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Abstract 

We have reported that transcription of a hypothetical small open reading frame (orf60) in enteroaggregative £ coli (EAEC) 
strain 042 is impaired after mutation of aggR, which encodes a global virulence activator. We have also reported that the 
cryptic orf60 locus was linked to protection against EAEC diarrhea in two epidemiologic studies. Here, we report that the 
orf60 product acts as a negative regulator of aggR itself. The orf60 protein product lacks homology to known repressors, but 
displays 44-100% similarity to at least fifty previously undescribed small (<10 kDa) hypothetical proteins found in many 
gram negative pathogen genomes. Expression of orf60 homologs from enterotoxigenic E. coli (ETEC) repressed the 
expression of the AraC-transcriptional ETEC regulator CfaD/Rns and its regulon in ETEC strain H10407. Complementation in 
trans of EAEC 042orf60 by orf60 homologs from ETEC and the mouse pathogen Citrobacter rodentium resulted in dramatic 
suppression of aggR. A C. rodentium orf60 homolog mutant showed increased levels of activator RegA and increased 
colonization of the adult mouse. We propose the name Aar (AggR-activated regulator) for the clinically and 
epidemiologically important orf60 product in EAEC, and postulate the existence of a large family of homologs among 
pathogenic Enterobacteriaceae and Pasteurellaceae. We propose the name ANR (AraC Negative Regulators) for this family. 
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Introduction 

Enteroaggregative Escherichia coli (EAEC) is a diarrheagenic 
pathotype linked to traveler's diarrhea, foodborne outbreaks and 
sporadic diarrhea in industrialized and developing countries [1—4]. 
A Shiga toxin-lysogenized EAEC caused a large and highly lethal 
European outbreak in 20 1 1 , highlighting the potential virulence of 
this pathotype [4-6]. 

Virulence gene expression in EAEC is activated in coordinate 
fashion by a regulator called AggR, a member of the AraC/XylS 
family of bacterial transcription factors [7-9] ; AggR is most closely 
related to the CfaD/Rns activator of enterotoxigenic E. coli 
(ETEC) [10], RegA in C. rodentium [11] and PerA in enteropatho- 
genic E. coli (EPEC) [12,13]. AggR, Rns/CfaD, RegA and PerA 
are required for expression of genes mediating the biosynthesis of 
colonization fimbriae and other virulence-associated factors. We 
have recendy reported that AggR regulates the expression of at 
least 44 genes, including itself, at the onset of the logarithmic 
growth phase [14]. 

Among the most dramatically upregulated genes in the AggR 
regulon was a very small cryptic open reading frame (orf; 
designated variously orfBO or orffil, but here only orf60) lacking 



homology to any known gene, but located on the pAA2 virulence 
plasmid nearby to the gene encoding AggR itself [14]. We were 
struck by the high prevalence and conservation of orfBO among 
EAEC strains. In an effort to understand the highly mosaic 
character of the EAEC genome and its relationship to endemic 
diarrhea in developing countries, we scored the presence of AggR- 
dependent and AggR-independent putative virulence genes 
among collections of EAEC strains from children under 5 years 
of age with diarrhea in Mali [15] and Brazil [16]. In both studies, 
we were struck by the strong and consistent negative association 
between the presence of orfBO /61 and diarrhea. Given compelling 
data favoring some important role for cryptic locus orf50/61 in 
EAEC epidemiology, we undertook a thorough investigation of its 
role in pathogenesis. 

We report here that EAEC orf60 encodes a novel protein 
negative regulator, now formally re-named Aar (AggR- 
activated regulator). Although expression of the Aar-encoding 
gene in EAEC is indeed activated by AggR, expression of the 
gene down-regulates expression of AggR itself. Most surpris- 
ingly, we show that Aar is a member of a previouly 
unrecognized large class of regulators in pathogenic Gram 
negative bacteria. 
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Author Summary 

We report here the identification and characterization of a 
new family of negative regulators in Gram-negative 
bacteria, including many pathotypes of diarrheagenic 
Enterobacteriaceae and members of the Pasteurellaceae. 
Members of this regulator family in enteroaggregative 
(EAEC) and enterotoxigenic £ coli (ETEC) and in Citrobacter 
rodentium downregulate the expression of positive regu- 
lator partners AggR, CfaD/Rns and RegA, respectively, all 
members of the AraC/XylS family of regulators. Accord- 
ingly, we propose the name ANR (AraC Negative Regula- 
tors) for this family. ANR members orf60 (termed Aar), 
orf02851 (Rnr), orf0450 and orf01070 (Cnr) from EAEC, C. 
rodentium and ETEC respectively were characterized in this 
study. Deletion of ANR homologs upregulated the expres- 
sion of AggR and RegA in EAEC strain 042 and C. rodentium 
respectively; overexpression of orf60, orf02851, orf0450 
and orf01070 in EAEC strain 042 down-regulated AggR. C. 
rodentium harboring a null mutation in orf02851 exhibited 
a significant increase in expression of the regA and RegA- 
regulated fimbriae. The orf02851 mutant showed higher 
levels of C. rodentium in feces and colonic contents, and 
greater weight loss compared to mice inoculated with the 
wild-type. 

Results 

Altered fimbrial expression in 0A2orf60 

To examine the role of orfBO, a 042orf60 mutant was generated, 
assessed for established EAEC phenotypes and observed by TEM. 
The orfBO mutant was complemented in trans by expressing the 
predicted orf downstream of the arabinose promoter in pBAD30, 
to yield pOrfBO. Since the orf generated two possible protein 
products given translational start from an ATG or alternative start 
codon, the construct included the most upstream (alternative) 
potential start site. 

Surprisingly, 042orf60 was found to be hyper-fimbriated by 
negative staining in TEM (Figure 1, B and E), and readily 
distinguishable from the parent strain (Figure 1, A and D), or the 
042or/60 mutant complemented in trans with the pOrfBO plasmid 
(Figure 1, C and F), both of which showed sparse fimbriae. As 
predicted from the documented role of AAF in aggregation, larger 
bacterial aggregates were visualized by TEM in the 042orf60 
mutant, when compared to the wild type or complemented strains. 
The hyper-aggregative phenotype of 042otf60 was also apparent in 
undisturbed broth cultures (Fig. SI in Text SI, panels A and B). 
We have previously demonstrated that the AAF/II major pilin 
protein AafA binds to fibronectin [17]. To evaluate fimbrial 
function in the absence of orfBO, we assessed this phenotype in 
042orf60 and complemented constructs. As expected, the hyper- 
fimbriated 042orf60 strain exhibited more abundant binding to 
fibronectin-coated plates (~200% binding) compared to the 
parent strain (Figure 1, panel G), while overexpression of orfBO 
in the complemented EAEC strain yielded marked reduction of 
fibronectin binding. 

As predicted from these phenotypes, 042orf60 yielded more 
abundant expression of the major AAF/II fimbrial subunit protein 
AafA than its wild type parent in immunoblot analysis, and this 
phenotype was reversed in the complemented construct (Fig. S 1 in 
Text SI, panel C). We therefore employed qRT-PCR to assess the 
effect of orf60 on aaf fimbrial gene expression. 042 wild type, 
042orf60 and 042o»y6tf(pOrf60) strains were grown in DMEM high 
glucose and samples from log-phase cultures were used for qRT- 
PCR analysis (Figure 1, FT). We observed higher levels of 



expression for fimbrial genes aafA (10- to 25-fold), aaJB (4- to 11- 
fold), aafC (4- to 6-fold) and aaJD (3- to 8-fold) in the 042orf60 
mutant compared with the parent strain, while frimbrial gene 
expression in the complemented strain was nearly undetectable. 

AggR and orf60 are cognate partners in a regulatory 
feedback system 

Our data suggested that orfBO may act as a negative feedback 
regulator of fimbrial gene expression, despite the fact that both the 
fimbriae and orfBO are under AggR activation. We sought to 
dissect features of this regulatory system in EAEC using qRT- 
PCR. Levels of transcription of aggR (Figure 2, A) and the aggR- 
regulated genes aap (Figure 2, B) and aatPABCD (Figure 2, C) were 
compared between 042or/60 and the parent strain by qRT-PCR. 
We observed two-fold higher aggR mRNA levels in 042orf60 
compared to the parent strain (Figure 2, A). Expression of other 
AggR-regulated genes was similarly affected by orfBO deletion, 
including aap, aatP, aatA, aatB, aatC, and aatD (Figure 2, B and C). 
The greatest effect was observed for the aatC gene, expression of 
which was 37-fold higher in the orfBO mutant compared with the 
wild type parent strain (Figure 2, C). Complementation in trans 
with the orfBO gene resulted in undetectable levels of aggR and 
AggR-regulated genes, comparable to the levels seen in EAEC 
lacking the pAA2 plasmid (Figure 2A, B and C). We examined 
similarly aggR and orfBO expression in the 042aggR and 042orf60 
mutant strains during the middle-late log phase of growth (5-8 h). 
aggR expression was consistently high in the absence of orfBO 
(Figure 2, E), whereas orfBO levels were drastically reduced in the 
absence of aggR (Figure 2, F). These data suggest that the effect of 
orfBO on fimbrial gene expression was via a negative feedback 
effect on aggR itself, accompanied by the predicted suppression of 
other genes downstream in the AggR regulon. 

We hypothesized that orfBO may represent a delayed negative 
feedback regulator of aggR. To test this hypothesis, RNA was 
obtained from strain 042 cultures prepared in DMEM high 
glucose at various stages of cell growth, spanning the log and early 
stationary phases (0-8 h of growth), and analyzed for aggR and 
orfBO transcripts (Figure 2, D). As previously reported [18], we 
observed a rapid increase in aggR expression in the first 2 h of 
growth (early log phase), while expression of orfBO was delayed, 
achieving maximal levels after 4 h of incubation (middle log phase) 
(Figure 2, D); at this time point aggR expression was seen to be 
declining, consistent with activation of orfBO by AggR and 
repression of aggR by the orfBO product. At 6 h of growth, both 
aggR and orfBO levels were declining. Interestingly, orfBO gene 
expression reached its lowest expression level at the end of late log 
phase (8 h), while aggR expression was again increasing. Both aggR 
and orfBO were at low levels by 10 hr of growth (Figure 2, D). 

orf60 encodes a regulatory protein 

We sought to determine whether orfBO encodes a small 
regulatory RNA molecule or a regulatory protein. Expression of 
orfBO from pOrfBO was assessed after mutation of three potential 
start codons in the open reading frame: predicted codons 1, 4 and 
5 (illustrated in Figure 3A as Ml, M4, and M5). Plasmid 
derivatives (pOrfBO-Ml, pOrf60-M4, and pOrf60-M5) were used 
to complement the 042orf60 strain. Transcripts for orfBO, aggR and 
aap (Figure 3, B, C and D) were determined in all 042o?jf6"f(pOrf60) 
derivatives. In all strains, high levels of transcripts for orfBO were 
detected by qRT-PCR (Figure 3, B). Replacement of the ATG 
only in position 5 with a stop codon lead to abrogation of the 
orfBO-induced phenotypes, manifested as expression of aggR and 
the AggR-regulated gene aap (Figure 3, panels C, D). The hyper- 
aggregative phenotype of 042orf60 was not reversed when the 
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Figure 1. Increased expression of AAF in Wlorf60. Transmission electron microscopy of negatively stained samples was performed for 042 
(Panels A and D), Q42orf60 (Panels B and E), and 042orf60(pOrf60) (Panels C and F). Samples were visualized at a magnification of x5,000 (Panels A, B 
and C), or x25,000 (Panels D, E, and F). Bacterial adherence to fibronectin-coated plates was quantified as described in Materials and Methods (Panel 
G). Transcript levels for aaf fimbrial genes were measured in 042 (black bars), 042orf60 (gray bars), 042orf60(pOrf60) (open bars) and 042pAA2(-) 
(negative control; striped bars) by qRT-PCR during the log phase of growth (Panel H). Asterisks indicate significant differences by ANOVA (***, P< 
0.0001; *, P<0.05). 

doi:10.1371/journal.ppat.1004153.g001 



strain was transformed with pOrf60-M5, indicating requirement 
of this predicted start codon in orfBO expression (Fig. SI in Text 
SI, panels A,B). 

Our data suggested that orfBO would be transcribed and 
translated as a small protein of 7.23 kDa. In order to observe and 
localize the orfBO protein, bacterial samples were fractionated into 
outer membrane, cytoplasmic /periplasmic fractions, and analyzed 
by Western immunoblot (Figure 3, E). Localization of the product 



at the predicted mass confirmed its existence and localized the 
product to the cytoplasmic/periplasmic fraction of the bacterium. 

Using the Bprom algorithm, a strong sigma-70 bacterial 
promoter region was predicted upstream of the ATG start site; 
predicted —35 and — 10 regions were included in plasmid pOrfBO. 
To confirm that constitutive expression of orfBO was driven by this 
predicted promoter, the araC gene and Pbad promoter were 
removed from pOrfGO plasmid to generate pOrf60-2 (Fig. S2 in 
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Figure 2. Orf60 represses expression of AggR and its regulon. Transcript levels for aggR (Panel A), aap (Panel B) and aatABCDP (Panel C) were 
evaluated in 042 (black bars), 042orf60 (gray bars), 042orf60(pOrf60) (open bars) and 042pAA2(-) (striped bars) strains by qRT-PCR. Transcripts for aggR 
(Panel E) and orf60 (Panel F) were examined in the 042aggR and 042orf60 mutant strains during the middle to the late log phase (5 to 8 h). Expression 
of aggR and orf60 was analyzed in the 042 wild type strain by qRT-PCR over time (Panel D). Asterisks indicate significant differences by ANOVA (***, 
P<0.0001). 

doi:1 0.1 371 /journal.ppat.1 0041 53.g002 
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Figure 3. Orf60 encodes a regulatory protein. Hypothetical start codons TTG (Ml and M4) and ATG (M5) (Panel A), were substituted by stop 
codon TAG in the orf60 sequences of pOrf60. Plasmid derivatives (pOrf60-M1, pOrf60-M4, and pOrf60-M5) were used to complement the 042orf60 
strain. Transcripts for orf60 (panel B), aggR (panel C), and aap (panel D) were determined in all 042o/f60(pOrf60) derivatives by qRT-PCR. Outer 
membrane (OM), and cytoplasmic/periplasmic fractions (C/P) of 042orf60 and 042orf60(pOrf60) were extracted and analyzed by Western blot (Panel 
E). Asterisks indicate significant differences by ANOVA (***, P<0.0001). 
doi:1 0.1 371 /journal.ppat.1 0041 53.g003 



Text SI, panel A). pOrfB0-2 plasmid was transformed into 042 
and its orffiO mutant, and RNA transcripts for orffiO and aggR were 
quantitated by qRT-PCR. High transcriptional levels of orffiO 
were detected by qRT-PCR in strains transformed with pOrffiO 
and pOrffiO-2 (Fig. S2 in Text SI, panel C). 042 derivatives 
transformed with pOrffi0-2 still retained the inhibitory effect of 
orffiO (Fig. S2 in Text SI, panel D). 

High transcriptional levels of orffiO did not compromise the 
fitness of the cell, as manifested by growth curves in DMEM 



medium (Fig. S3 in Text SI, Panel A). To assure that the effects of 
orffiO expression in trans were not due to high copy number over- 
expression artifacts, we cloned the gene into low copy number 
vector a pACYC177. This lower copy number construct restored 
the wild type phenotype (Fig. S3 in Text SI, Panels B, C, D). 

orf60 belongs to a new family of regulators 

Blast analysis of orffiO and its predicted product did not suggest 
significant relatedness to any known bacterial regulatory protein; 



PLOS Pathogens | www.plospathogens.org 



5 



May 2014 | Volume 10 | Issue 5 | e1004153 



New Family of Negative Regulators 



Mass Similarity/ 
(kDa) Identity (%) 



• E. coli M605 


8.73 


48/23.4 


• E. coli O103:H2 str. 12009 ECO103_3133 


8.71 


48/23.4 


E. coli DEC5E ECDEC5E_3204 


8.71 


48/25.5 


E. coli DEC8D ECDEC8D_3471 


6.50 


47/30.6 


E. coli DEC1 1C ECDEC1 1C_3229 


6.50 


47/27.8 


E. coli LT-68 ECLT68_2543 


6.47 


58/27.8 


E. coli 53638 


8.66 


46 / 23.4 


- STEC_H.1.8ECSTECH18_3311 


8.69 


44 / 23.4 


E. coli Fl 1 


8.84 


51/21.3 


E. coli 99.0741 EC970259_2989 


8.67 


51/21.3 


E. coli 1.2264 EC12264_3957 


8.80 


51/21.3 


1 E. coli 2.3916 EC23916_4904 


8.87 


52/37.0 


• E. coli 2.391 6 EC23916 4270 


8.90 


50/35.2 


■ E. coli 0127:H6 E2348C_2635 


8.91 


50/37.0 


C. rodentium ROD_02851 


8.91 


52/37.0 


E. coli EC40967_2945 


8.97 


50/37.0 


E. coli DEC6C ECDEC6C_2140 


9.02 


49/35.2 


- E. coli 541-15 EC541 15_10486 


8.20 


44 / 29.3 


" E. coli 1.2264 EC12264_A0187 


7.53 


54 / 34.0 


■ STEC_EH250 ECSTECEH250_4977 


7.15 


51/31.5 


■ H. haemolyticus M21639] 


8.65 


49/25.5 


- P. bettyae CCUG 2042 


8.67 





H. influenzae CGSHiAA_00225 
H. paraphrohaemolyticus HMPREF1054_1254 
H. parainfluenzae Haemo_02727 
H. haemolyticus GGE 2 1 62 
H. influenzae CGSHi6P18Hl_08165 
H. aegyptius HMPREF9095_0556 

IH. influenzae CGSHiAA_02626 
H. aeg>pri«sHMPREF9095_1993 



8.55 52/24.6 

7.85 56/39.6 
7.88 56/41.5 



7.73 55/35.2 



H. 



iMPREF9952_0719 



4 



rC 



M. haemolytica COK_1886 
M. haemolytica COI_1095 
M. haemolytica COK_0571 
P. multocida AAUPMG_1 1711 

Pantoeasp. GM01 PMI17_05078 

A. actinomycetemcomitans D17P3_2411 

A. Actinomycetemcomitans RHAA1_10936 
A. aphrophilus ATCC33389_2279 
H. haemolyticus HMPREF1053_0859 
H. influenzae CGSHiAA_0952 

EAEC 55989 pEC55989_0067 
E. coli 4_1_40B ESBG_05104 
O104:H4 HUSEC41 p55989_24840 
EAEC 042 t)AA060 
ETEC H10407 p948_0450 
ETECH10407 p948_1070 
ETEC1392/75 p746_052 

ETEC1392/75pl018_093 



8.32 


51/29.8 


8.26 


53/29.8 


9.04 


55 / 33.3 


6.82 


55 / 35.3 


7.08 


50/35.4 


8.14 


53/33.9 


7.85 


55 / 35.7 


8.17 


50/35.2 


8.29 
8.24 


50/33.3 
49/33.3 


8.43 


100/100 


7.60 


100/ 100 


4.36 


100/100 


7.23 


100/ 100 


7.45 


80/66.1 


7.49 


80/71.4 


7.84 


83 /62.5 


9.54 


72/55.9 



w 
a 

n 
w 

rs W E 

3 60 a 

S- H -• 

§ 2<f 
a Ci ^ 

O w 

w 

M 
O 

W 

n 



Pasteurella spp 



)■ Haemophilus spp 



J Mannheimia spp 

Pasteurella spp 
Pantoea spp 

L Aggregatibacter spp 



l Haemophilus spp 



C/3 
= 



PLOS Pathogens | www.plospathogens.org 



6 



May 2014 | Volume 10 | Issue 5 | e1004153 



New Family of Negative Regulators 



Figure 4. orf60, a new family of regulator proteins. Phylogenic analysis was performed for orf60 homologs by using the ClustalW algorithm. 
The protein molecular mass and percent identity and similarity to EAEC orf60 were determined for each predicted protein. 
doi:1 0.1 371 /journal.ppat.1 0041 53.g004 



however, the analysis did identify nearly 50 hypothetical proteins, 
present in 7 major genera/ species: Escherichia coli, Citrohacter spp., 
Haemophilus spp., Pasteurella spp., Mannheimia spp., Pantoea spp. and 
Aggregatibacter spp. (Figure 4). Like orf60, all predicted products of 
this family of hypothetical proteins exhibited low predicted 
molecular mass (39 to 80 amino acids, 4.36-9.54 kDa), and 
exhibited 44—100% similarity to orffiO (Figure 4). 

Haemophilus orffiO homologs were mainly found in pathogenic 
strains including H. influenzae, H. paraphrohaemolyticus, H. parain- 
Jluenzae, H. aegypticus, and H. pittmaniae subspecies [19,20]. The 
predicted protein masses of orfBO homologs in Haemophilus sp. 
varied from 7.73 to 8.65 kDa and exhibited 49 to 56% amino acid 
similarity to the EAEC orf60 product (Figure 4, dark gray boxes). 
orfBO homologs were also found in pathogens Pasteurella multocida 
and Pasteurella bettjae [21]; these predicted proteins displayed 54% 
similarity to orf60 with sizes between 6.82 to 8.67 kDa (Figure 4). 
OrffiO homologs with 51 to 55% similarity and masses between 
8.26 to 9.04 kDa were also identified in Mannheimia haemolytica, the 
agent of bovine respiratory disease complex [22]. 

OrffiO homologs in Aggregatibacter actinomycetemcomitans and 
Aggregatibacter aphrophilus exhibit 50 to 55% similarity to EAEC 
orffiO and their masses range from of 7.85 to 8.17 kDa. OrffiO 
homologs are also found in Pantoea spp., a Gram-negative 
bacterium associated with infections in neonates [23]. The orffiO 
homolog in Pantoea is a 7 kDa protein that shows 50% similarity to 
EAEC orffiO. 

orf60 homologs of pathogenic £ coli strains 

A phylogenetic subgroup of orffiO homologs was found among 
pathogenic E. coli genomes, subdivided into two main subfamilies 
(SFs), SF1 and SF2 (Figure 4). SF1 comprised the orffiO homologs 
found in enterohemorrhagic E. coli (EHEC) strain (12009, 
DEC5E, DEC8D), enteropathogenic E. coli (EPEC) (E2348/69), 
Shiga-toxin producing E. coli (STEC) (DEC11C, H.1.8, EH250, 
99.0741, 1.2264, 2.3916), enterotoxigenic E. coli (ETEC) (LT-68), 
extra-intestinal pathogenic E. coli (ExPEC) (Fl 1), enteroinvasive E. 
coli (EIEC) (53638); homologs were identified among additional E. 
coli strains designated M605, EC40967, DEC6c and 541-15. 
OrffiO homologs in the SF1 subfamily share 44 to 58% amino acid 
identity across the full length of the predicted proteins (Figure 4, 
light gray box). SF2 comprised homologs among EAEC (55989), 
ETEC (H10407, and 1392/75), and Shiga-toxin producing EAEC 
(HUSE41) strains; the archetype orffiO from 042 was assigned to 
this SF, all members of which shared 72-100% amino acid 
similarity (Figure 4). 

Among the most noteworthy predicted proteins was 
ROD_02851 (8.9 kDa) from C. rodentium, sharing 52% similarity 
to orffiO (Figure 4). C. rodentium is an enteric pathogen which 
pursues a pathogenic strategy similar to that of EPEC and STEC, 
and which harbors both an AggR homolog (RegA) and an orffiO 
homolog. 

We examined the genetic organization of the orffiO homologs 
among the diarrheagenic E. coli strains EAEC, ETEC, Shiga-toxin 
producing EAEC and C. rodentium. Unexpectively, in all the 
analyzed EAEC and ETEC genome sequences, four highly 
conserved features were found: a fimbrial operon, a dispersin- 
like protein, an Aat translocator system for dispersin, and an AraC 
family transcriptional regulator. Whereas the precise location of 
the genes varied, all were found to be located close to and in 



divergent organization up or downstream of AraC activators 
(Figure 5, A-G). 

Most interesting was the case of archetype ETEC strain HI 0407 
[24]. This strain has been described as having two copies of cfaD I 
rns, which encodes the AraC homolog most closely related to 
AggR; one of the copies exhibits a frameshift mutation and is 
therefore considered to be inactive [25] . We found orffiO homologs 
divergendy arranged in close proximity to both cfaD genes 
(Figure 5, E). Another ETEC strain, 1392/75, harbors two orffiO 
genes, located in the p746 (p746_052) and pl018 (pl018_093) 
plasmids (Figure 5, F, G). In contrast to EAEC and ETEC, the 
orffiO homolog of C. rodentium was located within a predicted 
phage-encoded region, flanked by flagellar operon genes and the 
Aat translocator system (Figure 5, H). 

Conservation of function would presumably dictate conserva- 
tion of orffiO protein structure across the family. To test this 
hypothesis, we assembled an alignment of orffiO family members 
and applied a variety of prediction algorithms to reveal the 
presence of conserved predicted secondary structure. PRO- 
MALS3D algorithm strongly predicted the presence of three 
consensus alpha helices at conserved locations in the family (Fig. 
S4). Several amino acid residues were highly conserved in identity 
or character across the family; these included a very highly 
conserved cysteine at position 60 in the EAEC orffiO protein. 

orf60 homologs trans-complement Q42orf60 phenotype 

We sought to address whether orffiO homologs from other 
diarrheagenic strains would be able to rescue orffiO function in the 
042orf60 mutant. For these experiments we chose orffiO homologs 
from SF1 (C. rodentium, ROD_02851) and SF2 (ETEC strain 
H10407, p948_0450 and p948_1070), exhibiting similarities of 52 
and 80% to orffiO respectively (Fig. 4). 

We transformed 0A-2orf60 with pOrf0450, pOrfl070, and 
pOrf02851; strains were grown in DMEM high glucose, then 
proteins and RNA were analyzed by western blot using anti-AafA 
antibody and qRT-PCR, respectively. AafA protein expression in 
042 was completely abolished when complemented with any of the 
orffiO homologs (Fig. 6B). Similarly, aggR expression was 
suppressed in strains complemented with any of the ETEC or C. 
rodentium orffiO homologs (Fig. 6A). Thus, orffiO homologs from two 
different subfamilies were capable of complementing orffiO 
function in 042. 

ETEC orf60 homologs repress CfaD/Rns and its regulon 

We hypothesized that the orffiO homologs from ETEC acted as 
regulators of the AggR homolog Rns/CfaD. ETEC HI 0407 was 
transformed with pOrf0450 and pOrfl070, grown and prepared 
for qRT-PCR analysis as above. In the presence of the orffiO 
homologs, we observed significant reduction in expression of cfaD 
itself (3- to 8-fold), as well as the CfaD-activated fimbrial genes, 
cfaA (2- to 9-fold), cfaE(2- to 13-fold), cfaC (4- to 12-fold) genes, and 
the gene encoding the dispersin-like protein CexE (11- to 24-fold) 
(Fig. 7, A, B). Interestingly, the orfl070 gene repressed cfaD and 
CfaD-regulated genes more strongly than did orf0450. 

C. rodentium orf60 homolog repress RegA and its regulon 

We sought to determine if the orffiO homolog of C. rodentium 
(ROD_02851) is a negative regulator of the AggR homolog RegA 
(Fig. 8). C. rodentium was transformed with pOrf02851K, grown 
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Figure 5. Genetic organization of orf60 homologs. DNA sequences for EAEC 55989 (35,013-59,068; GenBank NC_01 1752.1) (Panel A), £ coli str. 
4_1_40B (2,453-22,047, 75,374-80,019; GenBank NZJH660546.1) (Panel B), £ coli O104:H4 str. 01-09591 HUSEC41 (36,093-59,106; GenBank 
NZ_AFPS01 0001 02.1) (Panel C), EAEC 042 (4,520-48,410; GenBank FN554767.1) (Panel D), ETEC H10407 (34,686-45,807, 71,734-91,556; GenBank 
NC_017724.1) (Panel E), ETEC 1392/75 (4,965-26,649; GenBank NC_014234.1 and 51,218-86,537; GenBank NC_014232.1) (Panel F and G) and C. 
rodentium (274,826-379,753; GenBank NC_013716.1) (Panel H) were examined. Orf60 alleles (red arrows), AraC/XylS homologs (blue arrows), fimbrial 
operons (checkered arrows), and dispersin/Aat translocator systems (black arrows) are indicated. 
doi:10.1371/journal.ppat.1004153.g005 
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Figure 6. orf60 homologs from ETEC and C. rodentium repress AggR. 042orf60 was complemented in trans with orf60 homologs from ETEC 
(p948_0450 and p948_1070) and C. rodentium (ROD_02851). Expression of AafA was evaluated by SDS-PAGE and Western blot (Panel B) using anti- 
AafA antibody. Transcripts for aggR were evaluated in the strains by qRT-PCR (Panel A). Asterisks indicate significant differences by ANOVA (***, P< 
0.0001). 

doi:1 0.1 371 /journal.ppat.1 0041 53.g006 



and prepared for qRT-PCR analysis as above. In the absence of 
orf02851, we observed a significant increase in expression of regA 
(~10- to 12-fold) (Fig. 8, B), as well as the genes regulated by 
RegA: fimbrial genes kfiC, kfcE and kfcH (4- to 1 1-fold) (Fig. 8, C), 
dispersin (~80-fold) and the aat system (20- to 30-fold) (Fig. 8, D). 
Complementation in trans down-regulated the expression of regA 
and RegA-regulated genes (Fig. 8, B, C, D). Of note, mutations in 
the Kfc pilus genes were found to demonstrate early loss of 
colonization by C. rodentium [11]. 

Virulence of the C. rodentium orf02851 mutant in the 
C57BL/6 mouse model 

To examine potential effects of orf60-homolog orf02851 on 
virulence, groups of five mice were inoculated with the wild-type 



C. rodentium strain and the orf0285 1 mutant. Although both strains 
were recovered at high levels in stools collected from days 1 to 15 
(Figure 9A), orf02851 mutant showed sustained high levels of 
bacteria (10 7 to 10 10 ) for more than 1 1 days (day 3 to 15) (Figure 9, 
A), whereas the wild-type strain exhibited a significant decrease in 
bacterial shedding levels after day 12 (Fig. 9, A). Only two mice 
out of five were shedding the wild type strain by day 16 and 17 
(Table S3 in Text S2), while high levels of bacteria were detected 
in feces of mice inoculated with the orf0285 1 mutant (at levels of 
10 7 to 10 8 per gram of stool) (Table S3 in Text S2). Consistent 
with levels of shedding in the stools, we observed higher levels of 
challenge strain colonization in the colonic lumen in mice fed the 
wild type strain compared with those fed the orf02851 mutant 
(Fig. 9C). Mice inoculated with orf()2851 strain also showed greater 
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Figure 7. Repression of cfaD/rns and its regulon by ETEC-orf60 homologs. Transcripts for cfaD/rns, its fimbrial operon {cfaA, cfaE, and cfaQ 
(Panel A), and dispersin-like cexE (Panel B) were evaluated in ETEC H10407 (black bars), H10407(pOrf0450) (gray bars) and H10407(pOrf1070) (open 
bars) strains by qRT-PCR. Asterisks indicate significant differences by ANOVA (***, P<0.0001; **, P<0.001; *, P<0.05). 
doi:1 0.1 371 /journal.ppat.1 0041 53.g007 
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Figure 8. Repression of regA and its regulon by orf60 homolog-orf02851. Transcripts for orf02851 (Panel A), regA (panel B), fimbrial operon 
encoding genes (kfcC, kfcE, and kfcH) (Panel C), dispersin-like gene and aat system (Panel D) were evaluated in C. rodentium (black bars), orf02851 
mutant (open bars) and orf02851(pOrf02851K) (gray bars) strains by qRT-PCR. Asterisks indicate significant differences by ANOVA (***, P<0.0001; **, 
P<0.001; *, P<0.05). 
doi:1 0.1 371 /journal.ppat.1 0041 53.g008 



weight loss compared to mice inoculated with wild-type (Fig. 9B). 
Transmission electron microscopic examination of colonic tissues 
revealed more intense cytoplasmic vacuolization in animals 
infected with the mutant strain compared with wild type, 
accompanied by greater amounts of luminal debris (Fig. 10). 

Discussion 

Many microorganisms utilize members of large families of 
regulators, presumably tailoring them to their pathogenetic 
idiosyncrasies. The large AraC/XylS family of gram-negative 
transcriptional activators is defined by a 100-amino-acid region of 
sequence similarity that contains two helix-turn-helix (HTH) DNA 
binding motifs [7,9]; a number of family members are involved in 
carbon metabolism, responses to environmental stress, and many 
serve as virulence gene regulators [7,9]. Several virulence-related 
AraC-family activators, including AggR of EAEC, have been 
shown to exercise autoactivation, presumably in order to assure 
very rapid activation of their regulons upon entry into an 
environment conducive to pathogenic lifestyles. Interestingly, 
however, for few autoactivators has a very simple engineering 
question been plausibly addressed: how does an autoactivator 
check runaway positive feedback? Unrestrained activation could 
be maladaptive in terms of optimal bacterial metabolism, or could 
conceivably render the bacterium more virulent than is fitting for 
optimal persistence in the host population. For these reasons, 



mutations in negative feedback regulators of autoactivators could 
engender increased or decreased virulence. 

We suggest here the term Aar (AggR-activated regulator) to 
describe the orfbO gene product, and propose the term AraC- 
family negative regulators (ANR) to describe its family. The 
precise contribution of Aar in pathogenesis remains unknown. It 
could act by simply preventing excessive action of its activator 
partner, and/or could assure finer levels of on-demand gene 
expression at precise times or places in the setting of pathogenesis. 
Alternatively, Aar could act directly or indirectiy as a virulence 
suppressor, perhaps to modulate virulence as a result of selection 
towards clinical attenuation. Our finding of increased pathoge- 
nicity in oar-negative strains in Mali and Brazil would support this 
latter hypothesis [15,16]. Although the Aar-homolog mutation in 
C. rodentium yielded a predicted effect (enhanced intestinal 
colonization), it is unclear whether all Aar homologs act as 
virulence modulators. 

The mechanism by which orfBO checks aggR expression is not 
apparent from our analysis. Alignment of the orfBO family revealed 
conserved motifs: strongly predicted alpha helical structure with 
several highly conserved residues (Figure S4). Alpha-helical 
dominance is found in several DNA binding proteins [26], 
suggesting that this function could be provided by orfBO. 
Alternatively, orffiO could bind AggR, preventing its interaction 
with the DNA upstream of its activated promoters. A third 
possibility entails action of orf50 via an intermediate regulator. 
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Figure 9. Colonization of C57BL/6 mice with derivatives of C. rodentium. Bacterial shedding was quantified in 5 mice at selected time points 
after inoculations (Panel A). Mice received orally 10 10 CFU of C. rodentium (0) and orf02851 mutant (A) strains. As negative control 5 mice were 
inoculated with PBS (x). The mice were weighed daily and weigh loss was determined for each group (Panel B). Bacterial colonization in intestinal 
tissue was quantified in mice at day 9 post-inoculation (Panel C). Asterisks indicate significant differences by ANOVA (*, P<0.05). 
doi:1 0.1 371 /journal.ppat.1 0041 53.g009 
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Figure 10. Transmission Electron Microscopy of the intestinal 
tissues of mice infected with C. rodentium or its orf02851 
mutant. Groups of 5 mice were inoculated orally with 10 10 CFU of C. 
rodentium (Panel B) and orf02851 mutant (Panel C) strains. Five mice 
were inoculated with PBS as control (Panel A). Mice were euthanized on 
day 9 and colonic tissues were prepared for TEM analysis. White and 
black asterisks indicate cellular vacuolization and debris observed in 
orf02851 mutant respectively (Panel C). Samples were visualized at a 
magnification of x3,000 (Panel A), or x5,000 (Panels B and C). 
doi:1 0.1 371/journal.ppat.l 0041 53.g01 0 

These three, and other possible hypotheses are under investiga- 
tion. 

The existence of antivirulence genes is attracting increased 
attention. Early work on this area suggested the loss of basic 
metabolic genes whose activities interfered with virulence func- 
tions (e.g. cadA of Shigella spp.) [27]. Some pathogens express 
protein effectors that act to suppress virulence, often by 
diminishing the inflammatory response (e.g. Pic from Shigella 
flexneri) [28]. However, to our knowledge, Aar represents the first 
example of a dedicated pathogen-specific attenuator of virulence 
genes in gram-negative bacteria. Additional investigations are 
required to understand the role of Aar and its homologs in the 
setting of disease. 

Materials and Methods 

Bacterial strain and growth conditions 

Bacterial strains used in this study are shown in table S 1 in Text 
S2. Strains EAEC 042, 042aggR, and Q4:2aggR(pBADaggR) were 
previously described [18,29,30]. Bacterial cultures were routinely 
propagated in Luria Broth (LB) and Dulbecco's modified Eagle's 
medium with 0.4% glucose (DMEM high glucose) (Gibco, Grand 
Island, NY) as previously described [14]. Mutagenesis of orfBO in 
042 and C. rodentium strain ROD_02851 was accomplished by 
using lambda red technology [31]. The locus (46,236-46,438, 
GenBank FN554767.1) in 042 and (319,732-319,967, GenBank 
NC_013716.1) in C. rodentium were replaced with the kanamycin 
(km) resistance marker as previously reported [3 1] . 042orf60 and C. 
rodentium orf02851 strains were identified by PCR using specific 
primers for orfbO, orf0285 1 and a km resistance marker (Table S2 
in Text S2). We isolated a spontaneous pAA2 plasmid-cured 
derivative of strain 042 for use as a negative control in several 
assays. 

Complementation of 042orf60 and C. rodentium orf02851 

For the complementation of 042orf60, plasmids pOrf60, 
pOrf0450, pOrfl070 and pOrf2851 were generated in this study 
(Table SI in Text S2). Briefly, a 328-kb fragment encompassing 
the orf60 gene and its predicted promoter (see Fig. S2 in Text SI) 
was amplified by PCR (spanning nucleotides 46,141-46,433 in 
GenBank accession FN554767.1), and cloned into the EcoBA and 
Xbal sites of pBAD30; the resulting plasmid was designated 
pOrf60. pOrf60-2 was generated by removing araC and the Pbad 
promoter from the plasmid vector backbone (Fig. S2 in Text SI). 
The orffiO gene with native promoter was generated by PCR 
(pACYC primers) (Table S2 in Text S2) and cloned into the 
Hindlll/Smal site of vector pACYC177 (GenBank X06402) 
(Figure. S3 in Text SI). 

To express orf60 alleles from other bacteria, the EAEC orffiO 
gene was deleted by reverse PCR of pOrf60 so as to preserve the 
original EAEC orfBO translational start codon; the respective 
heterologous allele was fused downstream to assure transcription 
and translation at levels similar to EAEC. orfbO homologs 
from ETEC (p948_0450, genomic region 43,990-44,184 and 
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p948_1070, genomic region 88,912-89,109, GenBank 
NCJ317724.1) [32] and C. rodentium (ROD_02851, genomic 
region c3 19966-3 19733, GenBank NC_0 137 16.1) [33] were 
employed to generate plasmids pOrf0450, pOrfl070, and 
pOrf2851 respectively. For the complementation of C. rodentium 
orfU2851, the pOrf2851K plasmid was generated by inserting an 
encoding km r marker into pOrf285 1 backbone. 

Site directed mutagenesis of orf60 start codons 

Stop codons (TAG) were introduced in the three hypothetical 
alternative or canonical start codons for orffiO ((TTG-position 1), 
46,234-46,236; (TTG-position 4), 46,243-46,248 and (ATG- 
position 5), 46,246-46,248, (GenBank FN554767.1)) by using the 
QuikChange II XL Site-Directed Mutagenesis Kit (Agilent 
Technologies, CA USA). As a template for the mutagenesis 
reaction, 50 ng of pOrffiO plasmid was mixed with 125 ng of the 
corresponding primers (L1F-L1R; L4F-L4R and M5F-M5R) 
(Table S2 in Text S2) and 2.5 U/ul of Pfu Turbo DNA 
polymerase. Samples were treated for 1 cycle/3 minutes at 95°C 
followed by 18 cycles of95°C/30 seconds, 55°C/1 minute, 68°C/ 
8 minutes. Samples were digested with Dpril for 2 h at 37°C and 
transformed into XLIO-Gold Ultracompetent Cells (Agilent 
Technologies, CA USA). Plasmid derivatives were designated 
pOrf60-Ml, pOrf60-M4, and pOrf60-M5 respectively. All con- 
structs were verified by nucleotide sequencing at the University of 
Virginia DNA Science Core. 

Real-time quantitative reverse transcription-PCR (qRT- 
PCR) 

Overnight bacterial cultures of EAEC were diluted 1:100 into 
13 ml of DMEM high glucose (aggR-mducing conditions), and 
incubated at 37°C without shaking for the times indicated. For 
ETEC and C. rodentium, bacterial cultures were diluted 1:100 in 
13 ml of LB or DMEM-LB broth respectively. Extraction of RNA, 
cDNA synthesis and qRT-PCR assays were performed as 
previously described [14]. Primers for EAEC were previously 
published [14]. Primers for ETEC and C. rodentium genes are 
reported in Table S2 in Text S2. Reactions were run in 
experimental duplicate using two independent cDNA prepara- 
tions. Expression levels for each queried gene were normalized to 
the constitutively expressed cat gene in EAEC 042, rpoA in ETEC 
HI 040 7 and rpoD in C. rodentium as previously described 
[14,34,35]. 

Transmission Electron Microscopy (TEM) 

Strains were grown in LB overnight with shaking at 37°C, then 
diluted 1:100 in 5.0 ml of DMEM-high glucose, and incubated 
with shaking to reach OD 600 of 0.8. Samples were prepared for 
negative staining with 2% uranyl acetate as previously described 
[14]. Colonic sections of mice infected with C. rodentium derivatives 
were analyzed by TEM. Briefly, groups of five C57BL/6 mice 
were inoculated with 10 CFU of C. rodentium derivatives. Mice 
were euthanized at day 9 post-infection and descending colonic 
tissue was dissected and prepared for TEM analysis. The TEM 
samples were viewed on a Jeol JEM 1230 Transmission Electron 
Microscope (80 kV) at the Advanced Microscopy Laboratory at 
the University of Virginia (AML-UVA). 

Detection of AAF/II and orf60 proteins 

For detection of EAEC Aggregative Adherence Fimbriae II 
(AAF/II), strains were grown in 13 ml of DMEM high glucose to 
reach an OD 600 of 0.8. Bacteria were pelleted, resuspended in 
100 uj of 0.5 mM Tris, 75 mM NaCl and heated for 30 min at 



65°C. The major pilin subunit of AAF/II (AafA) was analyzed in 
the supernatant by SDS-PAGE and Western blot analysis. For 
detection of orf60 protein, outer membrane (OM) and cytoplas- 
mic /periplasmic (C/P) fractions of E. coli 042orf60 and 042orf60(- 
pOrf60) were prepared as described [36]. Protein samples were 
separated in acrylamide gels and transferred to Immobilon-P 
membranes (BioRad, Hercules CA, USA) by using standard 
protocols. The membranes were incubated overnight with anti- 
AafA or anti-orffiO antibodies respectively. The next day, the 
membranes were washed twice in PBS-0. 1 % tween, and incubated 
for 1 h with a horseradish peroxidase-conjugated goat anti-rabbit 
IgG antibody. Membranes were developed by using TMB 
Membrane peroxidase substrate (KPL, Gaithersburg, MD, USA) 
following the manufacter's specifications. 

AAF phenotype assays 

Fibronectin-binding was assessed as previously reported [17]. 
The autoaggregation assay was performed as previously described 
(13). 

Bioinformatic and statistical analysis 

Hypothetical proteins were analyzed by using Clustalw algo- 
rithms (http://www.genome.jp/tools/clustalw/). Protein mass, 
secondary structure and similarity was determined in predicted 
proteins by using bioinformatic tools (http://www.bioinformatics. 
org/sms/prot_mw.html, http:/ /www.ch.embnet.org/software/ 
LALIGN_form.html and http:/ /prodata.swmed.edu/promals3d/ 
promals3d.php. —35 and —10 regions of the P 0 jf&o promoter were 
predicted by using Bprom algorithm (http://linuxl.softberry. 
com/cgi-bin/programs/gfindb/bprom.pl). Statistical analysis of 
the data for fibronectin binding, fecal shedding and gene 
expression was performed by using the GraphPad Prism 6 
(GraphPad Software, Inc., CA, USA). The statistical significance 
of the differences in the sample means was calculated by using 
ANOVA with post hoc Tukey's correction. Results were 
considered significant at P<0.05. 

C. rodentium mouse model 

All animal work has been conducted according to relevant 
national and international guidelines. Four to five week old male 
C57BL/6 mice were inoculated with 200 u.1 of a bacterial 
suspension containing 10 CFU in PBS using a feeding needle. 
The mice were weighed daily and fecal pellets were collected 
aseptically from each mouse. The number of viable bacteria per 
gram of feces was determined by plating serial dilutions of the 
samples onto media containing appropriate antibiotics. For 
bacterial colonization, groups of 8-10 mice were inoculated with 
containing 10 10 CFU in 200 ul PBS. Mice were euthanized at day 
9 post-infection and bacteria in the large intestinal lumen was 
quantified as described above. 

Ethics statement 

Animal experiments were performed in accordance with the 
Guide for the Care and Use of Laboratory Animals of the National 
Institutes of Health and with the permission of the American 
Association for the Assessment and Accreditation of Laboratory 
Animal Care. The protocol was reviewed and approved by the 
Institutional Animal Care and Use Committee of the University of 
Virginia (Protocol No. 3894). 

Accession numbers 

Presented are database accession numbers for genes identified 
by BLAST search and presented in Fig. 4. Numbers are listed in 
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order of similar 
YP_006099 179.1; 
ZP_10060736.1; 
YP_006203907.1; 
ZP_1 2482356.1; 
ZP_14441355.1; 
YP_002330 134.1 
ZP_12052353.1; 
ZP_1 2042956.1; 
ZP_03035517.1; 
ZP_1 1990461.1; 
ZP_1 1999917.1; 
ZP_1 3759700.1; 
ZP_03000778.1; 
ZP_1 1504183.1; 
gb|EFA27649.1; 
gb|EGT78764.1; 
gb|EGY30032.1; 
gb|EHK89847.1; 
ZP_0 1789986.1; 



ity to or!60 from strain 042: 
gi 1 2 185 1 1 2 1 5, | YP_0024 1 5673. 1 ; 
gi| 387610422, YPJ306203859.1; 
gi|299836136, YP_003717705.1; 
gi 1 298206490, YPJXJ37 17592.1; 
gi| 417147810, ZP_1 1988310.1; 
gi| 283784065, YP_003363930.1; 
gi|419153984, ZP_1 3698552.1; 

ZP_12047171.1; 
ZP_1 3682492.1; 
ZP_08349390.1; 
YP_003223018.1; 
ZP_1 2274849.1; 



gi 1 417259639, 
gi 1 4191 37699, 
gi 1 331648301, 
gi 1 260845240, 
gi 1 41 7624551, 
gi 1 41930 1338, 
gi 1 41 76 15862, 
gi| 338217140, 
gi| 341956103, 
gi 1 348653724, 
gi 1 386908035, 
i 1 145635267, 



ZP_13843337.1; 
ZP_12266306.1; 
gb|EGP03044.1; 
gb|EGT82542.1; 
gb|EGY69408.1; 
gb|EIJ72734.1; 
ZPJ) 179097 1.1; 



gi 1 387604910, 
gi 1 386283102, 
gi 1 387610470, 
gij 417835927, 
gi 1 419923404, 
gi 1 215487703, 
gi|417264971, 
gi|417251191, 
gi 1 191174000, 
gi 1 41 7 150722, 
gi|417166561, 
gi|419216701, 
gi 1 188493508, 
gi|415811870, 
gi| 270315363, 
gi 1 341952232, 
gij 347813358, 
gij 359755683, 
gi 1 145634275, 
gi 1 145635897, 



ZPJ 179 1585.1; gi| 229844281, ZP_04464421.1 
ZP_05988704.1; gi| 261493480, ZP_05990003.1 
ZP_05991771.1; gi| 329122764, ZP_08251338.1 
ZP_08252775.1; gij 343518941, ZP_08755927.1 
ZP_09 184883.1; gi| 3863891 16, ZP_10073947.1 
ZP_10127061.1; gi| 398801995, ZP_10561225.1 



gi|261492133, 
gi 1 261495320, 
gi 1 329124231, 
gij 359299044, 
gij 387770888, 
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